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Abstract 

In this paper, a new mixed Poisson distribution is introduced. This 
new distribution is obtained by utilizing mixing process, with Poisson dis¬ 
tribution as mixed distribution and Transmuted Exponential distribution 
as mixing distribution. Some distributional properties like unimodality, 
moments, over-dispersion, Taylor series expansion of proposed model are 
studied. Estimation of the parameters using method of moments, method 
of moments and proportion and maximum likelihood estimation along 
with data fitting experiment to show its advantage over some existing 
distribution. Further, an actuarial applications in context of aggregate 
claim distribution is discussed. Finally, we discuss a count regression 
model based on proposed distribution and its usefulness over some well 
established model. 
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1 Introduction 


Recently, count data have drawn attention of many researchers working in dif¬ 
ferent area of Insurance, economics, social sciences and biometrics. For this 
purpose, traditional models like Poisson, Negative binomial. Geometric and 
their generalizations were used. But often it has been found that count data 
exhibits over-dispersion (variance > mean) and long tail behaviour. Hence there 
is further demand to modify/generalize these traditional models in encounters 
such problems. In last two decades, many attempts have been made by the re¬ 
searchers to develop new models, one such method which has been widely used 
to model count data is mixture of the distribution which have been widely used 
for modelling observed situations whose various characteristics as reflected by 
the data differ from those that would be anticipated under the simple compo¬ 
nent distribution, Karlis and Xekalaki (2005). 

Many count-data models proposed by mixing Poisson parameter (A) with var¬ 
ious continuous distribution which overcomes the problems of under or equi- 
dispersion, for development in literature of Mixed Poisson distribution see the 
references below 


































Poisson-Gamma (Negative Binomial) -Greenwood and Yule (1920) 
Poisson Beta with specific parameter values (Yule) -Simon (1955) 
Poisson Beta Type-2 -Gurland (1958) 

Poisson-Exponential Beta -Pielou (1962) 

Poisson Truncated Poisson -Patil (1964) 

Poisson Beta Type 1 -Holla and Bhattacharya (1965) 

Poisson Truncated Gamma -Bhattacharya(1966) 

Poisson Linear Exponential -Sankaran(1969) 

Poisson Bindley -Sankaran (1970) 

Poisson Power Function -Rai (1971) 

Poisson Lognormal -Bulmer (1974) 

Poisson Generalized Inverse Gaussian -Sichel (1974) 

Poisson Inverse Gaussian -Sichel(1975) 

Poisson Gamma Product Ratio(Generalized Waring) -Irwin (1975) 
Poisson Generalized Pareto -Kempton (1975) 

Poisson-Poisson Distribution(Neyman) -Douglas (1980) 
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• Poisson Pearson’s Family of Distribution -Albrecht (1982) 

• Poisson Generalized Gamma -Albrecht (1984) 

• Poisson Truncated Beta Type 2 -Willmot (1986) 

• Poisson Log-Student -Gover and O’ Muircheartigh (1987) 

• Poisson Shifted Gamma -Ruhonen (1988) 

• Poisson Exponential(Geometric) -Johnson et al. (1992) 

• Poisson-Other Discrete Distribution(Neyman) -Johnson et al. (1992) 

• Poisson Linear Exponential -Kling and Goovaerts (1993) 

• Poisson Inverse Gamma - Willmot (1993) 

• Poisson Truncated Gamma -Willmot (1993) 

• Poisson Pareto -Willmot (1993) 

• Poisson Shifted Pareto -Willmot (1993) 

• Poisson Modified Bessel -Ong and Muthaloo (1995) 

• Poisson-Power Variance -Hougaard et al. (1997) 

• Poisson-Lomax -Al-Awadhi and Ghitany (2001) 

• Poisson Bindley Distribution(Size Biased) -Al-Multairi (2008) 

• Zero Truncated Poisson-Lindley -Ghitany et al. (2008) 

• Poisson Generalized-Lindley -Mahmoudi and Zakerzadah (2010) 

• Poisson-Lindley-Beta -Gomez-Deniz et al. (2014) 

• Poisson-Marshall-Olkin-Generalized Exponential -Gomez-Deniz et al. (2015) 

The above monographs presents more flexible distributions as they can be 
used as building blocks for improving count data models. In the present pa¬ 
per, a new discrete distribution is obtained by mixing Poisson distribution with 
Transmuted-Exponential distribution proposed by Shaw and Buckley (2007). 

To the best of our knowledge, we have not come across any literature on discrete 
distribution, particularly mixed Poisson distribution, where transmuted family 
is being used, except Charkraborty and Bhati(2015) who proposed the discrete 
version of Transmuted exponential family. 

The rest of the paper is structured as follows. Section 2 describes the the¬ 
oretical development of the new count distribution, including some properties, 
different methods of estimation are shown in Section 4. An actuarial application 
of the proposed model is examined in Section 5. Finally in Section 6, applica¬ 
tion of the proposed model in count data analysis and count regression model 
is presented. Some comments and conclusions are drawn in Section 7. 
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2 Proposed model 


Shaw and Buckley (2007) proposed a novel technique to introduce skewness 
and kurtosis into symmetric as well as to other distribution. In this techniques 
they use “transmutation map”, which is functional composition of cumulative 
distribution of one distribution with the quantile function of another. One such 
member of this transmuted family is “Transmuted Exponential Distribution” 
{TS'D{a,9)) whose density function given as 

fo for X < 0, 

I a9e~^^ + 2a9e~^^^ for x > 0, 


with |q;| < 1 and 9 > 0. 

Considering the fact that {T£'D{a,9)) possess wide range of statistical proper¬ 
ties as compared to exponential distribution, we introduce a new mixed Poisson 
distribution assuming (l~S'D{a,9)) as prior distribution for Poisson parameter 
(A). The dehnition of purposed model is as follows 


Definition 1 A random variable X is said to follow Poisson-Transmuted Ex¬ 
ponential distribution if it possess following stochastic representation 

A:|A~ Po{X) 

X\a,9^ T£V{a,9) 

for A > 0, |q;| < 1, d = 1 — a and 9 > 0. We denote unconditional distribution 
of X by VT£'D{a,9) and its pmf is given theorem 1. 


Theorem 1 If X ^ 'PT£'D(a, 9), then probability mass function (pmf) of X is 

a 2a 


P{X = x;a,9) = 9 
with |a| < 1 and 9 > 0. 


(l-b6»)^+i (l-f26»)®+V’ 

Proof: If X\X ~ Pois{X) and A ^ T£'D{a,9) then 

OO 

P{X =x)= 9 j P{X = x|A)/a(A; a, 9)dX 


X — 0 , 1 ,, 




x\ 


, ee-^^{a + 2ae-^\)d\ 


OO 

= ^ y e-^A^ (d -b 2ae-®A) e'^^dA 


2a 


(1-b 0)^+1 (1-b 26»)^+i y ’ 


( 2 ) 
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where |a| < 1 and 6 > 0 . 


Remarks 


i It is easy to see that the p.m.f. in ([^ can be rewritten as a finite mixture 
of two geometric distribution in the form, 


Px = a Geom 


i + e 


+ a Geom 


1 


1 + 20 


(3) 


ii 


For Of —^ 0 pmf (2) reduces to Geometric distribution Geo 



iii 


For Of —1 pmf (2) reduces to Geometric distribution 



Further, the cumulative distribution function and the survival function of X ^ 
VT£V{a,0) can be given as 


and 


Fx (x) = P{X <x)=J2^ 


n—0 
=1 - 


2a 


(1 + (1 + 20)"+i 

a a 


(1 +0)^+1 (l + 20)^+i 


{x) = P{X >x) = 


(4) 

(5) 


Figure 1 shows the pmf of VTSPia, 0) for different values of a, 0 which confirms 
the unimodality of random variable, moreover, this can proved mathematically 
in proposition 1. 


Proposition 1 The Proposed distribution with p.m.f. & is unimodal. 

Proof: The probabilities defined in ([^ can also be computed by following re¬ 
cursive relation 


with 


p (a; + 1) 


1 

TT0 


/ \ x-\-2 I 

a + 2a ^ 1+261 j J 


p(x) 


_ 0 (1 + Q + 20) 

^ (1 + 0 ) (1 + 20 )' 


( 6 ) 


Moreover, it can be observe that possess relation 


Px+l>Px V x<x*, 

Px+l<Px V x>x*, 


(7) 
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Figure 1: PMF plot for different values of a and 9. 
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for a;*(mode of VTST) r.v.) depending on parameters a and 9, which indicate 
the unimodality, see Keilson and Gerber (1971). 

Further, 

2 2a{a)9‘^ 

Px-Px+lPx-l = 

For —1 < a < 0, the distribution is infinitely divisible since —px-iPx+i < 
0 and po 7 ^ 0, pi 7 ^ 0, see Warde and Katti (1971) for details. Then, in this case 
the distribution has its mode at zero. The fact that {px/px-i} , x = 1, 2,..., 
forms a monotone increasing sequence in this case requires that {px} be a de¬ 
creasing sequence (see Johnson and Kotz, 1982, p.75), which is congruent with 
the zero vertex of the new distribution for — 1 < a < 0. Moreover, as any 
infinitely divisible distribution defined on non-negative integers is a compound 
Poisson distribution (see Proposition 9 in Karlis and Xekalaki, 2005), we con¬ 
clude that the new pmf presented in this paper is a compound Poisson distri¬ 
bution. 

Furthermore, the infinitely divisible distribution plays an important role in 
many areas of statistics, for example, in stochastic processes and in actuarial 
statistics. When a distribution G is infinitely divisible then for any integer 
X > 2, there exists a distribution Gx such that G is the x-fold convolution of 
Gx, namely, G = G*®. Also, when a distribution is infinitely divisible an upper 
bound for the variance can be obtained when — 1 < a < 0 (see Johnson and 
Kotz, 1982, p.75), which is given by 

var{X) < —. 

Po 

Some implications of infinite divisibility are as follows: 

1. For k, m = 1,2,..., 

/k + m\ 

1 ]Pk+mP0 ^ PkPm- 

\ m J 

See Steutel and van Harn (2004), p. 51, Proposition 8.4. 

2. For all /c = 0,1,..., Pa; < exp(—1). See Steutel and van Harn (2004), p. 
56, Proposition 9.2. 

3. The cumulants of an infintely divisible distribution on the set of non¬ 
negative integers (as far as they exist) are non negative, see Steutel and 
van Harn (2004), p. 47, Corollary 7.2. This will imply that the skewness 
of the new distribution is positive, since the third cumulant equals the 
third central moment. 

4. The distribution is strictly log-concave and strongly unimodal, see theorem 
3 in Keilson and Gerber (1971). 
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As indicate in above remark, since ([^ is log-concave and zero vertex for some 
value of parameter a and 6, we can determine the mode(a;*) of VTS'D{a,9), 
which will be determine such the P{X = x) is increasing on (0,1, • • • ,x*) and 
decreasing on (a;* -I- 1, • • •). It can be easily verified- 

1. For — 1 < a < — ^ , the pmf J 2 I is unimodal and the mode is at 

xo = 0. 


2. For — 


(i+ 2 ey 

3+4S 


< a < 1, the mode of pmf ( 2 ) will be at x* + 1 , with 
X* = [logg. ~2], where 9* = Here [.] denotes the integer 

part. 


3. Further if [logg. ~ 2] is an integer, then pmf (|^ will be bimodal 

and the mode is at x* and x* + \ respectively. 


Moreover, the new distribution is as a mixed Poisson distribution it has a heav¬ 
ier tail than a Poisson distribution with the same mean. Now, let Pr(A' = x) be 
the probability function ([^ and Pr(X|m) be the probability function of a sim¬ 
ple Poisson distribution with the same mean, say m. Then, as shown by Feller 
(1943), Pr(A: = 0) > Pr(0|TO) andPr(A: = l)/Pr(X = 0) < Pr(l|TO)/Pr(0|TO) = 
m. The asymptotic tail behavior of Poisson distributions was studied for Will- 
mot (1990). 

In the following results, we present various properties such as Taylor Expansion 
for VTE probabilities , probability generating function, moments, hazard func¬ 
tion of VT£{a, 9) 


Taylor expansion oi 'PT£{a,9) probabilities 


Ong(1995) gave the Taylor series expansion of mixed poisson distribution which 
is stated as: 

Let g{x) be the probability density function (pdf) of the mixing distribution of 
a mixed Poisson distribution. If g(x) has a finite derivative at the point 
fc > 1 for all n, then the mixed Poisson pmf P{k) has the formal expansion 




/(")(fc) 


( 8 ) 


where f{x) = xg{x), f^^\x) = and gi is the moment about the mean 

of the gamma random variable X with scale and shape parameters 1 and k 
respectively. 

In the proposed model, considering f{x) = xg{x) = x9e~^^ {a + ae~^^), and 
f^’‘\x) = ((—l)*“^e®^(d(i — Sx)-I-(—l)*“^2*“^a(f — 20x)) and using 

(7), the probability of VT£T>{a, 9) can easily be obtained. 











Proposition 2 The probability generating function(pgf) of random variable X 
defined in ^ is given as 


p ^ ^(1 - t)(l + a) + 26>^ 

’ ~ {l-t + 0){l-t + 29)' 


(9) 


Proof: The proof is straight forward after using the definition Px{t) = E{t^) 

and ([^. ■ 

Proposition 3 The raw moment of VT£T>{a,9) is given by 


E{X'^) 


0(d)$(^,-r,O) 20a$(^,_r,O) 

1 + 9 ^ 1 + 29 


( 10 ) 


where $(z, s, a) = ^ z^{k + a) ^ is Hurwitz-Lerch transcendent function. 
fc=0 

In particular, the first four raw moments of X can be obtained easily by 


putting r=l,2,3,4 in (10) and are as follows 


Ml = 


2-a 


M2 =- 


29 ’ 

4 - 3a + 26» - 
^02 ’ 


Ms =- 


M4 =- 


24 + 240 + 402 _ 21a - 180a - 29^a 
403 ’ 

48 + 720 + 2802 + 203 - 45a - 630a - 2102 _ 

204 ’ 


whereas other measures like variance(/i 2 ), coefficient of skewness ( 71 = 

V ^2 

and kurtosis ^72 = are as follows 
4 + 20(2-a)-a(2 + a) 


M2 =- 


7i = 


402 

2(8 + 40(3 + 0) - 3a - 20a(3 + 0) - 3a2(l + 0) - a3) 


( 11 ) 

( 12 ) 


72 =- 


[4 - 20(a - 2) - a(a + 2)]^^^ 

16(1 + 0)(9 + 0(9 + 0)) - 8(1 + 0)(9 + 0(12 + 0))a - 8(6 + 0(9 + 29))a^ - 12(1 + 9)a^ - 3a4 


(4- 20(a- 2) - a(a + 2))2 


(13) 


In figure 2 the contour plot of Mean, Variance, Skewness and Kurtosis for pa¬ 
rameters a and 0 were shown. 

Further, the coefficient of variation(C.V.) of the distribution comes out to 
be 

V4 - 20 (a - 2 ) - a(a + 2 ) 

H 2 — a 
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Figure 2: From left top-in clockwise direction we present contour plot of Mean, 
Variance, Kurtosis and Skewness of VTS'D{a, 9) 
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3 Steps to generate VTST>{a^9) random variate 

As the proposed model is derived from mixed Poisson distribution, following 
algorithm can be used to get VTEV{a,9) random variable 


Algorithm 

Step 1 Generate Ui from {7(0,1). 

Step 2 Corresponding to each m, determine Ai = ^ log ^ i-a+VU-2a-4uia± ^'^ ^ 
Step 3 Hence, generate Xi from Poisson{Xi). 

4 Estimation 

In this section we discuss three methods to estimate the parameters a and 6, 
In the first subsection the method of moment is presented, further in second 
section, method of proportion and moment is discussed and finally Maximum 
Likelihood method is shown as last subsection. 


4.1 Method of moments 


Given a random sample xi,X 2 , - ■ • , of size n from (1), the moment estimates, 
a and 9, oi a and 9 can be obtained by solving the following equations 


TOi = ^1 = 


2-a 


and 


m2 = fJ,2 = 


4: - 3a + 29-a9 

■ 


(14) 


where mi and m 2 are the first and second sample moments. Solving the above 
equations, we get 


3to? mi \/ 4mi + 9m? — 4m2 

a = 2 H-1-^ AV-1- 1 -7. 

mi — m2 mi — m2 

—3mi — yj 4mi + 9mf — 4m2 


9 = 


2(mi — m 2 ) 


(15) 


Theorem 2 For fixed a, the estimator 9 of 9 is positively biased, i.e. E(9) > 9. 
Proof: Let 9 = g{X) and g{t) = for t > 0. Then 

,,, , 2 — a 

9 9) = > 0- 

Therefore, g{t) is strictly convex. Thus, by Jensens inequality, we have 
E {g{X)) > g (E{X)). Finally, since g (E{X)) = g = 9, hence we obtain 

E{9) >9. U 
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4.2 Method of proportion and moment 

In this method, we compare the sample proportion of zero(po) and sample mean 
(x) with the population proportion of zero and population mean, i.e, estimated 
a and 0 will be obtained by solving the following two equations 


P[)= 9 


2a 


l + 6 » 1 + 261 


and 


X = 


2-a 

~W' 


(16) 


Thus, solving the above two equations, we obtain the estimates a and 6 as 
follows 


a4(4- 


3a; 


3a;po a;^9 - lOpo - 8a;po + pI 


X +po — I X — I 


x + pq - I 


e = 


3 — 3po + \/9 — lOpo — &xp + 


pI 


4ix+po - 1 ) 


(17) 


4.3 Maximum likelihood estimation 

Let a;i,a; 2 ,--- ,Xm be an random observation of size m from our proposed 
Poisson-Transmuted exponential distribution. The log-likelihood function for 
the vector of parameter 0 = {a, 9)^ can be written as 


l{a, 9\x) = m\og9 + E log 


a 

(1 + 6 »)^*+i 


(1 


2 a 

26»)®-+i 


The normal equations can be obtained by taking the first derivative with 
respect to both parameters. These equations are given by 


m 

dl 

da 


n 


(a; + 1) ( 


(a) 


4oc 


(1 + 0)03 + 2 T (1 + 20 )^ + 2 


_^_ 2 a_A 

x=0 (^(l+e)x + l -I- (i+ 2 e)x+i J 


=E 


m ^ — 


_1_L _2_ 

{i+0y+^ ^ (i-i-2e)“’+i 


- f (“) I 2 a 

V(n-e)“^+i -I- (i+2e)^+i 


= 0 . 


= 0 , 


(18) 


(19) 


The solution of above two equations gives the maximum likelihood estimator 
of parameter a and 9 and can be solved numerically or direct numerical search 
for global maximum of the log likelihood surface. On may also use nlm() function 
in open source R-software for determine the global maxima. Further, the second 
order derivative of log-likelihood function are 


d'^l 

da"^ 



20 


ii+0r 


( 1 + 20 )^ 


\ ( (l-a)0 2ae 

' I (l-|-e)“’i + l ^ (l + 20 Yi + '^ 


( 20 ) 
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d^l 

dOda 



(l+e)x, + l + (1+20)^.+! + (l+e)^i+'i + 1) (l+2e)“=. + 2 + 1) 


(l-a)e . 2a0 

(l+e)“=i+i (l+ 2 e)“=i+i 


9 - 


(i+e)^i+i ^ (1+26)' 


-)( 


{a)(l — 9xi) I 2a(l — 20Xi)' 
(l+0)^i+2 + (l+ 2 e)‘“i +2 


/ (a)e I _ 

\^(i+e)“=i+i (i+ 2 e)“=i+i 


2 ae 


, ( 21 ) 


d^l n 


( (i+e)x(+ 2 (l - Ox,) + (i+ 2 e^x .+2 (1 - 26»a;*)) 


902 02 




( 


(l-a)0 . 2a0 

(l+0)^i + i (l+20)^i + ^ 


(a^i + 1) ^ (2 0a;i) + (i_|_2e)i^i+3 (1 Oxi)^ 


{l-a)0 , 2a0 

(l+e)i+xi -I- (i+ 20 )i+^i 


( 22 ) 


The fisher information matrix can be computed by using the approximations 

/ 92 ; X / q 2 i X 

'v9a90; ^'v3a90;.^’ 


•92/\ /92; 

89^ J I 302 


Q :,0 


where a and 0 be the maximum likelihood estimator of a and 0. 


5 Collective risk model 

In non-life insurance portfolio, say, motor insurance, the aggregate loss (S') is a 
random variable defined as sum of claims incurred in a certain period of time. 
Let us consider now the following actuarial model. Let X be the number of 
claims in a portfolio of policies in a time period. Let 1^, z = 1, 2,... be the 
amount of the z-th claim and S = Yi+Y 2 + ----\-Yx the aggregate claims gener¬ 
ated by the portfolio in the period under consideration. As usual, two fundamen¬ 
tal assumptions are made in risk theory: (1) The random variables yi,y 2 ) • ■ • 
are independent and identically distributed and follows a discrete (continuous) 
distribution with pmf (pdf) h{y) and (2) The random variables A,yi,y 27 --- 
are mutually independent. The distribution of the aggregate claims S is called 
the compound distribution and assuming that L), z = 1,2 ,...A are discrete 
random variables, the pdf of S is fs{y) = Y^^=oPxh*'^{y)j where h*^{-) denotes 
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the x-fold convolution of h{-) and Px is given in ([^. There exists an extensive 
literature dealing with compound mixture Poisson distributions see Willmot, 
(1986,1993) and Antzoulakos and Chadjiconstantinidis (2004). An extensive 
review of the topic can be found in Sundt and Vernic (2009). Based on the re¬ 
cursion provided by Panjer (1981) for the Poisson distribution, Sundt and Vernic 
(2009, chapter 3, p.68) developed a simple algorithm to provide the probabilities 
of the random variable S when the amount of the single claim follows a discrete 
distribution with pmf h(x). 

From the Panjer (1981) recursion for the total claim amount when the pmf of 
the Poisson distribution is assumed as the distribution of the number of claims 
is given by 

fsix\X) =-'^yh{y)fs{x - y\X), x = l,2,... (23) 

X ^ 


while /s(0|A) = e Following Sundt and Vernic (2009, p.68) we get, after 
multiplying in (23) in both sides by A*/(A) and integrating we have that. 


fsiv) = - ^yKx)fs 
y ^—1 


'i+l 


(y-x), z = 0,1,.. .; 2 /= 1,2,... 


where fg{y) = X^fs{y\X)f{X)dX. Now, starting with /s(0) = po, the prob¬ 
abilities fs{l), fs{2),... can be evaluated by the algorithm described in Sundt 
and Vernic (2009,p.68) and having into account ([^. 


For more detail on classic risk model, see Freifelder (1974), Rolski et al. 
(1999), Nadarajah and Kotz (2006a and 2006b) and reference therein. Here, we 
consider two such situations: In first situation, the primary distribution is as 
defined in Section 2 and claim severity distribution as exponential distribution 
with parameter (a) and as we know, Erlang loss distribution may arise in in¬ 
surance settings when the individual claim amount is the sum of exponentially 
distributed claims hence in second situation. Erlang distribution with parame¬ 
ters r and a is considered as secondary distribution. 


Theorem 3 If we assume a Poisson Transmuted Exponential distribution with 
parameter {a, 9) as primary distribution and and an Exponential distribution 
with parameter (A) as secondary distribution, then the pdf of aggregate loss ran- 

X 

dom variable S = ^Yi is given by 

i=0 


fs{y) 


-eXy -26Xy T 

aAe 2Q:Ae i+^e 


{\Y0f (1 + 20 )" ’ 


for y>0, 


whereas, 


/s(0) = 9 


a 

(1+^) 


2a A 

(l + 20)j- 
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Proof: By assuming that the claim severity follows an exponential distribution 
with parameter A > 0, since the fold convolution of exponential distribution 
is gamma distribution with parameter m and A, the n*^fold convolution is given 

by 


r"(2/) = = 1,2,... 

(x- 1)! 

Then the pdf of random variable S is given by 

V(l + ^)“+^ (l + 26 ')^+V 

= 0 ( ^ -I- 2a 


yX-l X 


= de-^y 


aAei+« 


Xy 

2aAeT+2« 


[(1 + 0)" (1 + 20)"J 


■ _ . -‘>^v 
aXe 1+0 


- 2 eXy 

2aXe 1+20 


(1 + 0 )" (1 + 20 )^ 


Theorem 4 If we assume a Poisson Transmuted Exponential distribution with 
parameter (a, 0) as primary distribution and an Erlang distribution (2, A) with 
parameter (A) > 0, as secondary distribution then the pdf of aggregate loss ran¬ 
dom variable S = X]i=o ^2/ 


with 


fsiy)=X0e-^y 



y (1 + 0 ) 3/2 


(1 + 20)3/2 

/ 


/s(0) 


(1 + 0 ) 


2a A 


Proof: By assuming that the claim severity follows Erlang (2,A) distribution, 
then the fold convolution of Erlang distribution is gamma distribution with 
parameter (2n, A) .The fold convolution is given by 


riy) = 


A2 


(2cc- 1)! 


y^-^e-^y,x=l,2,..., 


Then, the pdf of the aggregate random variable S is given by 
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fs{y) = 


a 


2a 


\ yxy2x-l^-\y 


=ee-'’Y. 

x—1 

( 


(1+ 0)^+1 (l + 26»)^+V (2a;-1)! 

2 a:-i 2a 


_ ^ _ \ 2 x y _ _ 

(1 +0)^+1 (2a;-1)! (l + 26»)^+i 


A" 


=\ee-^y 


I sinh 


(v(i+«) 


yA 


2a sinh 


(v(l+2») 


yA 




V 


( 1 + 61 ) 3/2 


(1 + 20 ) 3/2 


{2x-l)\l 


6 Data analysis 

6.1 Count data modelling 

In this section, the applicability of Poisson Transmuted Exponential distribu¬ 
tion has been shown by considering a data set representing epileptic seizure 
counts considered earlier by Albert(1991) and Hand et. al.(1994) p. 133 and 
has compared with following distributions namely 


i Generalized Poisson-Lindley Distribution(57^£(a, 0))(Mahmoudi and Za- 
kerzadeh(2010)): 

r(a; + a) 0“+^ 


f{x;a,e) = 


a;!r(a + l) (1 + 0)1+^+^= 


a + 


X + a 
1 + 0 


ii Weighted Generalized Poisson Distribution {yVQV'D{a, s, 0))(Chakraborty 

( 2010 )): 

G(u) = y , where u = 


K{a, s, b) 
iii Poisson Distribution (7^(A)): 

e 




P{x)=^-^,X>Q. 


x\ 


iv Negative Binomial Distribution {JVB{r,p)): 

^x + r — 1 


fix) = 


p^{l-p)'^,r> 0,p e (0,1). 


V A New Generalized Two Parameter Poisson-Lindley Distribution {NQVC{a^ 0)) 
Bhati et al. (2015)): 


fix; a, 9) = 


( 0 + «)(! + 0)^+1 


a{x + 1) 

(1 + 0) 


a; = 0,1,... 
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Table 1: Distribution of epileptic seizure counts 



Observed 



Expected frequency 



Count 

Frequency 

P 

MB 

wgv 

gvc 

Ngvc 

VTS 

0 

126 

74.94 

91 

118.11 

121.51 

122 

121.925 

1 

80 

115.71 

86.6 

95.81 

92 

91 

91.6166 

2 

59 

89.34 

63.37 

59.89 

59 

58.74 

58.5609 

3 

42 

46 

42.57 

34.49 

35.1 

35.22 

34.7734 

4 

24 

17.75 

27.6 

19.24 

20.1 

20.52 

19.8403 

5 

8 

5.48 

17.6 

10.59 

11.18 

11.22 

11.0557 

6 

5 

1.41 

10.5 

5.81 

6.1 

6.39 

6.07055 

7 

4 

0.31 

6.52 

3.18 

3.3 

3.25 

3.30188 

8 

3 

0.06 

5 

3.88 

2.71 

2.5 

3.85564 

Total 

351 

351 

351 

351 

351 

351 

350.99997 


parameter 

A = 1.544 

f = 1.757 
p = 0.463 

a = 1.089 

h = 0.295 

s=-l 

0=1.139 

d = 1.292 

6 = 1.116 

a = 2.906 

a = -0.701 

6 = 0.873 


log likelihood 

-636.05 

-595.22 

-595.83 

-594.61 

-594.48 

- 594.85 


chi-square 

256.54 

22.53 

7.12 

5.94 

5.75 

5.36 


6.2 Count regression including covariates 

In last two decade researchers have contributed significantly in the area of counts 
regression modelling, for example Duan et al.(1983), Christensen et al.(1987), 
Cameron et al.(1998), Cartwright et al.(1992) and Deb and Trivedi(1997), Gomez- 
Deniz (2010). Here, in this section, we present the application of the pro¬ 
posed distribution in modelling the situation when the count variable {Yi) also 
called response variable depends on one or more than one exogenous variable 

In order to achieve this goal, let us consider the following re-parametrize 

v = 2 — a and 9= —, (24) 

2/x 

where, /ii is the mean of the response variable (Yi) is related with set of inde¬ 
pendent variables with log-link function log (/i^) = xj[3, for i = 1,2,..., s and (3 
is a vector of unknown regression coefficient. 

The above link function will ensure the positivity of /ii and after re-parametrization, 
the log-likelihood of the new model including covariates will be written as 


n 

= y^log 

2 = 1 


2 /^* ( 1 + 2 ^) 


(2 — i^)i 


!/i + l 


(25) 


and the normal equations are The parameters {v, Pi, 132, ■■■, /?«) in the above log- 
likelihood function can be estimated by maximizing the log-likelihood function 
for given dataset using optim( ) function in R-program, and the initial values 
of the parameters were chosen from Poisson regression model. 
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6.3 Illustrative example 

The US National Medical Expenditure Survey 1987/88 (NMES) data were con¬ 
sidered which can be obtained from Journal of Applied Econometrics, 1997 Data 
Archive at http://qed.econ.queensu.ca/jae/1997-vl2.3/ deb-Trivedi/. This data 
were originally used by Deb and Trivedi(1997) in their analysis of various mea¬ 
sures of healthcare utilization. The data set consists of 4406 individuals covered 
by Medicare, the USA public insurance programme. Just for illustration, we 
are considering the Here we model, the number of stays after hospital admission 
(HOSP) as the response variable, because of the fact that, it have over-dispersed 
as well as large proportion of zero. The details and the summary statistics of 
this response variable as well as the set of explanatory variables is given in table 
(2). Here the mean and variance of the response variable indicate the over¬ 
dispersion as well as existence of large number of zeros. Hence it is adequate to 
apply the our model for the presented dataset. 


Table 2: Summary Statistics of Response and Explanatory Variables 


Variable 

Measurement 

Mean 

std dev 

OFP 

Number of physician visit 

5.774 

6.759 

HOSP 

Number of hospital stays 

0.296 

0.746 

POORHLTH 

Self-perceived health status, 
poor=l, else=0 

0.13 

0.33 

EXCLHLTH 

Self-perceived health status, 
excellent =1, else = 0 

0.08 

0.27 

NUMCHRON 

Number of chronic conditions 

1.54 

1.35 

MALE 

Gender; male = 1, else =0 

0.404 

0.491 

SCHOOL 

Number of year of education 

10.29 

3.739 

PRIVINS 

Private insurance indicator, yes =1, no = 0 

0.776 

0.417 


In Table (2), the maximum likelihood estimates of Poisson (Pois) regression 
Model, Negative Binomial (MB) Regression model and Poisson-Transmuted 
Exponential(7^Tf) Regression Model, including the intercept /3i (the regres¬ 
sion estimate when all variables in the model are evaluated at zero) were pre¬ 
sented. For comparability of these models we use the value of the maximum 
log-likelihood function {Imax) and Akaike information criterion (AIC) defined 
as —2lmax + 2fc, where k be the no. of parameters in the regression model. 
These values are shown in Table (3). 
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Table 4: log-likelihood (Imax) and Akaike information criterion (AIC) Value 



Regression Model 

Criterion 

Pois 

NB 

VTS 

LLrnax 

AIC 

-17971.50 

35959 

-12170.55 

24359 

-10521.89 

21061.78 


Further, it is observed that the estimates of all parameters are found significant 
at 5% level of signihcance as p-value for all the parameter estimates are less then 
5%level of signihcance. It can be clearly seen from table that the log-likelihood 
value of VTS regression model is highest, whereas AIC also indicate the better 
ht of VT£ regression model compared to other two models. 

7 Comments 

In this paper we have proposed a new mixed Poisson distribution and derived 
its distributional properties. Actuarial application to risk modelling were pre¬ 
sented. Moreover, htting of VTST) indicates the flexibility and capacity of the 
proposed distribution in data modeling of count data and also in count regres¬ 
sion including covariate set up. It has been shown that this distribution out 
performs other competing distributions. 
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